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Abstract 

The 2-C-methyl-D-erythritol-4-phosphate (MEP) pathway provides the precursors for the biosynthesis of plastidial isopre- 
noids, which include the carotenoid pigments of many fruits. We have analysed the genes encoding the seven enzymes 
of the MEP pathway in melon (Cucumis melo L.) and determined that the first one, 1 -deoxyxylulose 5-phosphate synthase 
(DXS), and the last one, 1-hydroxy-2-methyl-2-(£)-butenyl 4-diphosphate reductase (HDR), are represented in the genome 
as a small gene family and paralogous pair, respectively. In the case of DXS, three genes encode functional DXS activi- 
ties which fall into previously established type I {CmDXSI) and II {CmDXS2a and CmDXS2b) categories, while a fourth 
DXS-like gene belonging to the type III group did not encode a protein with DXS activity. Their expression patterns and 
phylogenies suggest that CmDXSI is functionally specialized for developmental and photosynthetic processes, while 
CmDXS2a and CmDXS2b are induced in flowers and ripening fruit of orange- (but not white-) fleshed varieties, coinciding 
with p-carotene accumulation. This is the first instance connecting type II DXS genes to specialized isoprenoid biosyn- 
thesis in the fruit of an agronomically important species. Two HDR paralogues were shown to encode functional enzymes, 
although only CmHDRI was highly expressed in the tissues and developmental stages tested. Phylogenetic analysis 
showed that in cucurbits such as melon, these HDR paralogues probably arose through individual gene duplications in a 
common angiosperm ancestor, mimicking a prior division in gymnosperms, while other flowering plants, including apple, 
soy, canola, and poplar, acquired HDR duplicates recently as homoeologues through large-scale genome duplications. 
We report the influence of gene duplication history on the regulation of the MEP pathway in melon and the role of special- 
ized MEP-pathway isoforms in providing precursors for p-carotene production in orange-fleshed melon varieties. 

Keywords: 2-C-methylerythritol 4-phosphate, carotenoids, fruit development, isoprenoid biosynthesis, melon, metabolite 
profiling, phylogenetics, transcript profiling. 



Introduction 

Melon ( Cucumis melo L.) is an economically important spe- 
cies in the family Cucurbitaceae, which also includes cucum- 
ber (C. sativus), squash {Cucurbita spp.), and watermelon 
(Citrullus lanatus). It is one of the most valuable fleshy fruits 



for fresh consumption in North America and Europe due 
to its distinct flavor and sugar content. Besides sweetness, 
the shelf life, colour, and aroma of melon are all important 
traits which have an impact on fruit quality. The latter two 
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depend on the synthesis of isoprenoids, a ubiquitious fam- 
ily of natural compounds which play numerous central meta- 
bolic roles in all plants as precursors for protective pigments 
(carotenoids), hormones (brassinosteroids, gibberelHns, str- 
igolactones, cytokinins, and absiscic acid) and the side chains 
of chlorophylls, tocopherols, and quinone redox co-factors 
(Bouvier et a/., 2005). 

In melon fruit, non-climacteric white-fleshed varietals such 
as 'Piel de Sapo' (PS) generally lack carotenoid pigments, 
whereas (3-carotene, a nutritionally important carotenoid 
that serves as the main precursor for vitamin A synthesis, 
accumulates in climacteric orange-fleshed varieties such as 
'Vedrantais' (Ved) and Dulce (Dul), resulting in their char- 
acteristic orange hue. These carotenoid-rich and -deficient 
melon strains provide a convenient comparative system for 
understanding the molecular regulation of carotenoid forma- 
tion in fruit. However, an investigation of the upstream steps 
providing carotenoid precursors has not previously been 
undertaken. 

Of the two pathways in plant cells which synthesize iso- 
prenoid precursors, the 2-C-methyl-D-erythritol 4-phosphate 
(MEP) pathway is thought to be the main supplier of pre- 
cursors for carotenoids (Rodriguez-Concepcion, 2010). This 
pathway, as well as the cytosolic mevalonate pathway, pro- 
vides the universal C5 intermediates isopentenyl and dimethy- 
lallyl diphosphate (IPP and DMAPP) which form the basis 
for all higher isoprenoid biosynthesis (Hunter, 2007; Phillips 
et al., 2008; Cordoba et al., 2009). In this pathway, pyruvate 
and glyceraldehyde 3-phosphate are condensed through the 
action of 1-deoxyxylulose 5-phosphate synthase (DXP syn- 
thase or DXS). Following the reductive isomerization of DXP 
to MEP by DXP reductoisomerase (DXR), it undergoes five 
additional enzymatic steps en route to the formation of IPP 
and DMAPP. The last of these steps is a reductive dehydroxy- 
lation to form both of these products from l-hydroxy-2-me- 
thyl-2-(£')-butenyl 4-diphosphate (HMBPP) via HMBPP 
reductase (HDR). 

DXS is typically encoded in plant nuclear genomes as a 
small gene family (Grassi et al., 2013; Peng et al., 2013). 
It is now well established that multiple clades of function- 
ally specialized DXS classes exist in plants (Walter et al. , 
2002). This distinction is based on phylogenetic analysis, 
biochemical characterization, and gene expression pat- 
terns and has been observed in angiosperm and gym- 
nosperm plant lineages (Kim et al., 2006; Phillips et al., 
2007; Vallabhaneni and Wurtzel, 2009; Cordoba et al, 
2011). Type I DXS genes are constitutively expressed in 
green tissues and are thought to supply the precursors 
for housekeeping and photosynthetic metabolites such as 
carotenoids and chlorophyll. Type II DXS genes are usu- 
ally expressed in specialized contexts such as apocarote- 
noid-accumulating roots colonized by mycorrhizas (Walter 
et al., 2000) or in the resin duct epithelial cells of coni- 
fers in response to bark beetle attack or fungal infection 
(Phillips et al., 2007). A third clade of DXS-like sequences 
has recently been proposed (Cordoba et al., 2011). DXS 
activity has not been definitively demonstrated for this 
group, but in maize, the expression of one member of this 



clade, ZmDXS3, correlates with endosperm carotenoid 
accumulation (Vallabhaneni and Wurtzel, 2009). 

While the evolutionary origin of DXS from eubacterial 
sources has been investigated (Lange et al. , 2000; Krushkal 
et al, 2003), the origin and subfunctionalization of DXS 
paralogues within plant genomes remains poorly understood. 
HDR, hke DXS, is often encoded in plant genomes as mul- 
tiple copies (Kim et al, 2008; Kim et al, 2009; Grassi et al., 
2013; Kang et al., 2013), while the five intervening enzymes 
are encoded by singleton genes with few exceptions (Seetang- 
Nun et al., 2008). Gene duplications leading to multiple cop- 
ies of a gene may arise through several unrelated mechanisms, 
including retroposition, unequal crossing over, segmental or 
chromosomal duphcations, and polyploidy (Zhang, 2003; 
Shoemaker et al., 2006). Multiple models for the retention 
and evolution of gene duplicates have been proposed (Innan 
and Kondrashov, 2010), ultimately culminating in functional 
specialization (Veitia et al., 2008), pseudogenization, or 
deletion. 

We have taken advantage of the recently published melon 
genome (Garcia-Mas et al., 2012) to determine whether the 
genome architecture of MEP-pathway paralogues in melon 
could help explain the complex process of fruit carotenoid 
biosynthesis. We report that small-scale gene duplications in 
the MEP pathway have led to specialized isoforms in melon 
which play an important role in providing carotenoid precur- 
sors during fruit ripening, including the first report of type II 
DXS proteins involved in this process. 

Materials and methods 

Plant material 

Tissue from three different melon varieties were used: the non-cli- 
macteric/white flesh melon type Piel de Sapo (PS) from the inodorus 
group and the climacteric/orange flesh types Vedrantasi (Ved) and 
Dulce (Dul) from the cantalupensis and reticulatus groups, respec- 
tively. Plants were grown under the same greenhouse conditions as 
previously described (Eduardo et al, 2005). Flowers (one per plant) 
were hand pollinated to control fruit stages. Fruits were collected 
at four different stages covering fruit development and ripening 
[15, 25, and 35 days after pollination (DAP) and harvest; 45 DAP 
for Ved and 55 DAP for PS]. Flesh-mesocarp from three different 
fruit replicates of each stage was extracted from the middle of the 
fruit, avoiding rind, seeds, and jelly, and then frozen immediately in 
liquid nitrogen and stored at -80°C. Vegetative tissues (leaves, flow- 
ers, stems, and roots) were also frozen in liquid nitrogen and kept at 
-80°C. For light-induction experiments, Ved seeds were germinated 
in the dark for 4 days, transferred to the light (120 |iEinstein-m"^-s"') 
for 4, 8, 12, or 24h and then flash frozen in liquid nitrogen. 

Gene expression analysis by quantitative RT-PCR 

Total RNA samples were extracted using the Qiagen (Hilden, 
Germany) RNeasy Plant Mini Kit*. RNA quantity and quality were 
evaluated using a NanoDrop ND-1000 spectrophotometer (Thermo 
Scientific, Waltham, MA, USA) and assessed by agarose gel electro- 
phoresis. RNA was then treated with RNAse free TURBO-DNase 
I (Turbo-DNA-/f-et>'^^ Kit; Life Technologies, Ambion*, Carisbad, 
CA, USA) for 30min at 37°C before being used as a template for 
cDNA synthesis. First-strand cDNA synthesis was performed using 
1 |j,g total RNA, Superscript^"^ III First Strand Synthesis (Invitrogen 
Life Technologies, Carlsbad), and an oligo(dT)2o primer according 
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to the manufacturer's instructions. The synthesized cDNA was 
diluted 1:5 with pure water, and 2 |.il were used in subsequent reac- 
tions. Quantitative RT-PCR (QPCR) assays were performed on a 
LightCycler* 480 Real-Time PGR System (Roche Applied Science, 
Penzberg, Germany) using SYBR® Green I Mix (Roche Applied 
Science). Relative transcript levels of DXS and HDR isoforms were 
calculated using the primer efficiency-corrected relative quantifica- 
tion model (Pfaffl, 2001) to compare a single gene in different tis- 
sues or the AACt model to compare multiple genes in light induction 
or fruit development. In these cases, relative transcript levels were 
calculated using the median gene expression of all DXS and HDR 
genes surveyed. cDNA loading was normalized using three reference 
genes: cyclophilin (CmCYP7\ MELO3C025848) (Gonzalez-Ibeas 
et al., 2007; Mascarell-Creus et al, 2009), adenine phosphoribosyl- 
transferase (CmAPT; MELO3C013962), and actin 2/8 (CmACT; 
MELO3C023264). Primers for amplification of target and reference 
genes were designed with Primer Express® Software v2.0 (Applied 
Biosystems, Foster City, CA, USA). Primer sequences are listed in 
Supplementary Table 1 . Intra-assay variation was evaluated by per- 
forming all amplification reactions in triplicate. The amplification 
protocol consisted of an initial step at 95°C for lOmin, and 45 cycles 
at 95°C for 10 s and 60°C for 30 s. The specificity of the PCR ampU- 
fication was confirmed by melting curve analysis and by sequenc- 
ing of the PCR amplicons. Reverse transcriptase minus controls 
(RT-) were performed once for each RNA sample to determine the 
amount of contaminating genomic DNA, and non-template con- 
trols (NTC) were included in each plate to monitor the presence of 
DNA contamination and the formation of primer-primer dimers. 



Phylogenetic analysis of DXS and HDR proteins 
HDR and DXS protein sequences were downloaded from the NCBI 
database. Where indicated, sequences were obtained from genome- 
sequencing databases. Putative chloroplast transit peptide cleavage 
points were calculated using the ChloroP 1.1 Server (Emanuelsson 
er al., 1999) and after manual curation, they were excluded for the 
purposes of sequence analysis. The phylogenetic analysis was per- 
formed on the Phylogeny.fr platform (www.phylogeny.fr). Sequences 
were aligned with MUSCLE (v3.7) with default settings. The phy- 
logenetic tree was reconstructed using the maximum likelihood 
method implemented in the PhyML program (v3.0 aLRT). The 
default substitution model was selected assuming an estimated pro- 
portion of invariant sites of 0 and 4 gamma-distributed rate catego- 
ries to account for rate heterogeneity across sites. The gamma shape 
parameter was estimated directly from the data (gamma - 0.769). 
Reliability for internal branches was assessed using the aLRT test 
(SH-like) (Anisimova and Gascuel, 2006). Graphical representation 
and editing of the phylogenetic trees were performed with Interactive 
Tree of Life (iTOL) (Letunic and Bork, 2010). For SNP analysis of 
coding and cw-regulatory regions, the gene sequences of CmDXS2a, 
CmDXS2b, and CmHDRl from the PS and Ved varieties, including 
the promoter (2kb upstream of the start codon) and 3'UTR (Ikb 
downstream of the stop codon), were retrieved from their previously 
available re-sequenced genomes (Sanseverino et al., unpublished) 
after mapping reads against the DHL92 melon reference genome 
(Garcia-Mas et al., 2012). SNP and short indels were extracted with 
SUPER V 0.9 (Sanseverino et al, unpublished) using the algorithms 
of BWA (Burrows-Wheeler Aligner) (Li and Durbin, 2010) (http:// 
bio-bwa.sourceforge.net) and SAMtools (Sequence Alignment/Map) 
(Li et al., 2009) (http://samtools.sourceforge.net) using a read depth 
>10 and a base-calling quality cutoff >30. Predicted transcriptional 
start sites and differences in transcription factor binding sites between 
PS and Ved were determined for these three genes using the PLACE 
database (Higo et al., 1999) (http://www.dna.afifrc.go.jp/PLACE/). 

l-iPLC analysis 

The isoprenoid pigment content of the melon fruit samples 
was examined by HPLC-DAD-FLD. Except where noted, all 



chemical reagents were obtained from Sigma-Aldrich (St. Louis, 
MO, USA). All samples were protected from light and heat dur- 
ing all steps of the extraction procedure. 5mg lyophilized melon 
tissue were extracted with agitation for 15min at 4°C with 
1ml chilled hexane:acetone:methanol (2:1:1) containing 6 \ig 
canthaxanthin-ml ', followed by the addition of 100 )xl water and 
further extraction for 20 s. Samples were centrifuged at maximum 
speed in a chilled bench top microcentrifuge (Eppendorf 5418R, 
Hamburg, Germany) for 3 min. The upper organic phase was aspi- 
rated and set aside while the pellet was extracted a second time with 
the same solvent lacking canthaxanthin. Following the same extrac- 
tion and centrifugation procedure, the organic phases were pooled 
and dried in a vacuum centrifuge (Eppendorf Concentrator Plus). 
Dried residues were resuspended in 150 |.il ethyl acetate and filtered 
through a 0.2 \im PTFE microfilter (Supelco). A 10 \i\ aliquot of 
each sample was injected onto an Agilent Technologies (Santa 
Clara, CA, USA) 1200 series HPLC system equipped with a diode 
array and a 250mni x 4.6mm C30 reverse phase column (YMC, 
Kyoto, Japan) equilibrated with methanohwater (99:1) containing 
0.2% ammonium acetate (solvent A). Metabolites were separated 
with the following gradient: isocratic separation in 100% solvent 
A, followed by a step to 15% solvent B (tert butyl methyl ether) at 
12 min and a linear gradient to 100% B by 38 min. Following a 6 min 
hold at 100% B, the column was returned to initial conditions and 
equilibrated for an additional 6 min. The flow rate was maintained at 
1 ml-min"'. Isoprenoid pigments were monitored at 472 and 650 nm 
and compared to the retention times of chlorophyll and P-carotene 
authentic standards. Tocopherol quantification was accomplished 
through fluorescence detection (excitation 290 nm, emission 330 nm) 
and comparison to a standard curve constructed from a-tocopherol. 



Cioning DXS and l-IDR isoforms 

In order to verify enzymatic activity of DXS and HDR gene fam- 
ily members, a combination of in vitro enzyme assays and bacte- 
rial complementation assays were carried out. In both cases, the 
corresponding genes bearing plastidial transit peptides were PCR 
amplified without their transit peptides to improve solubility in 
heterologous expression systems. All cloning steps were carried out 
using the Gateway^"^ cloning system (Invitrogen Life Technologies). 
Sequences of the corresponding attBl and attB2 primers used 
for each DXS or HDR gene are listed in Supplementary Table 1. 
Truncated amplicons lacking the portions of the full-length coding 
sequences encoding transit peptides were thus amplified bearing attB 
adaptors. The PCR programme consisted of a 3 min denaturation 
step at 95°C followed by four cycles of low-stringency PCR (96°C 
for 10 s, 48°C for 45 s, and 72°C for 2min). This priming stage was 
followed by 32 cycles of PCR as follows: 96°C for 10 s, 63°C for 45 
s, and 72°C for 2 min. The 50 |il PCRs included 1 nl pooled cDNA 
from various Ved organs, 0.25 \il AccuPrime^"^ high fidelity TAQ 
(Invitrogen Life Technologies), 10 pmol attBl and attB2 primer, 
and IX AccuPrime^*^ buffer I. PCRs were examined on 2% aga- 
rose gels, and reactions with a single product of expected size were 
used without further purification in a 10 |il BP clonase II reaction 
(Invitrogen Life Technologies) containing ~150ng PCR product and 
150ng pDONR207 entry vector. Following an overnight reaction 
at room temperature, recombined products were transformed into 
TOP 10 chemically competent cells by heat shock and plated onto 
LB media containing 50 |j.g-mL' gentamycin. Colonies were selected 
and grown in liquid media containing the same antibiotic. Plasmid 
DNA was purified using the QIAprep Spin Miniprep kit (QIAGEN). 
Multiple inserts of each gene were fully sequenced and compared 
to the published melon genome sequence. Purified plasmid DNA 
was diluted to 150 ng-mL' and used in an LR clonase II recombina- 
tion reaction (Invitrogen Life Technologies) with either a Gateway- 
ready version of the pET28 (for in vitro enzyme assays) or pET32 
(for bacterial complementation) plasmids (Clontech Laboratories, 
Mountain View, CA, USA), transformed as before, and plated on 
LB media containing either carbenicillin at 100 )j,g-mL' (pET32) 
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or kananiycin at 50 |xg-ml"' (pET28). The resulting transformants 
were grown in 25 ml LB media containing the appropriate antibiotic 
and purified as described above to generate stocks of each gene in 
expression vectors. 

Bacterial complementation 

The Escherichia coli HDR mutant strain MG1655 araOispH 
(McAteer et al, 2001) was used to verify activity of CmHDRl 
and CmHDR2 as described previously (Hsieh and Goodman, 
2005). The araoispH strain was maintained on LB media contain- 
ing 50 |ig-mr' kanamycin and 0.2% (w/v) arabinose. A modifed 
pET32 vector (Yu and Liu, 2006) bearing either melon CmHDRl 
or CmHDRl was introduced into competent araoispH cells and 
selected on LB plates containing 0.2% arabinose, 50 |j.g-mL' kana- 
mycin, and 100 |ig-mL' carbenicillin. Colonies were transferred to 
plates containing the same antibiotics but 0.2% (w/v) glucose in 
place of arabinose to suppress the expression of endogenous HDR 
activity in this modified strain. As controls, the same strain was 
transformed with the pQE30 plasmid containing E. coli HDR (posi- 
tive control) and pET32 containing spruce DXSl (negative control). 

DXS activity was first evaluated using a dxs deficient strain of 
E. coU which had been engineered to synthesize isopentenyl and 
dimethylallyl diphosphate (IPP and DMAPP) from mevalonate 
(Campos et al, 2001). Competent cells grown in the presence of 
ImM mevalonate were transformed with purified pET32 destina- 
tion vectors containing a candidate cDNA (CmDXSl, CmDXS2a, 
CmDXS2b, CmDXS3lDXL, or control) and plated on LB media 
containing ImM mevalonate and 100 |j.g-mL' carbenicillin. To test 
for a functional DXS gene, transformed colonies were transferred to 
LB-carbenicillin plates lacking mevalonate and incubated at 37°C 
overnight. 



In vitro DXS enzyme assay 

CmDXS3/DXL activity was further examined by in vitro assay 
following expression in E. coli strain BL21 Rosetta (Novagen 
Biosciences, Madison, WI, USA) using the pET28 vector. Following 
transformation of competent cells, 10 ml cultures of OverNight 
Express media (Novagen Biosciences) containing 10ml glycerol-L' 
and 50 |ig-mL' kanamycin were inoculated with a single colony and 
grown for 48 h at 28°C. Stationary cultures were harvested by cen- 
trifugation at 2000g for 20min at 4°C. The pellet was resuspended in 
ImL 50 mM Tris pH 8.0 containing 5% (v/v) glycerol, 50 mM KCl, 
25 mM MgC12, 1 mM ascorbic acid, 100 |xM thiamin pyrophosphate 



(TPP), 0.2% (v/v) Tween-20, and bacterial protease inhibitor cock- 
tail (Sigma Aldrich) diluted 1:100. The resuspended pellet was sub- 
jected to two freeze-thaw cycles using ethanolic ice and a water bath 
and then sonicated using a Branson Digital Sonifier (6min total, at 
25% intensity, with a 1 s duty cycle and 1 s rest between pulses). The 
lysed cultures were centrifuged at 16 OOOg- and 4°C, and the superna- 
tants were filtered through a 0.45 \xm acetate syringe filter. Protein 
concentrations were judged by comparison to a BSA standard curve 
constructed using Bradford reagent according to the manufactur- 
er's instructions (Bio-Rad Laboratories, Hercules, CA, USA). All 
protein samples were adjusted to 0.5 mg-mL'. A 30 |il ahquot of 
each crude protein sample was combined with 70 \A DXS reaction 
buffer. This buffer was identical to the resuspension buffer except it 
contained TPP at 1 mM and sodium pyruvate and glyceraldehyde 
3-phosphate at 14.28 mM each (final concentrations lOmM). Upon 
combining purified proteins with reaction buffer, the reactions were 
incubated at 25°C for 2h. Reactions were halted by addition of 
100 |jl CHCI3, followed by 10 s vigorous vortexing and Imin cen- 
trifugation. DXP in the upper aqueous fraction was analysed by 
LC-MSMS as described in Wright and Phillips (2014). 

Results 

Identification of MEP pathway genes in melon 

The MEP pathway provides precursors for isoprenoid bio- 
synthesis and has an impact on important fruit traits such 
as carotenoid production in melon. Based on a search of 
the recently published melon genome using orthologous 
sequences from other plant species, we identified and cloned 
the full complement of MEP-pathway genes (Table 1). 
Individual sequencing of isolated cDNAs from the orange- 
fleshed Ved variety indicated that all MEP pathway genes 
matched their sequences with the melon reference genome 
DHL92. The white-fleshed PS variety, in which the fruit does 
not produce carotenoids, was included in our analysis for 
comparative purposes. Whole genome re-sequencing of the 
same coding sequences from PS confirmed a near-identical 
correspondence at the nucleotide level and no amino acid 
differences in MEP-pathway enzymes compared to Ved. Of 
the seven steps in the pathway, five are catalysed by enzymes 



Table 1 . Melon genes of the MEP pathway 



Gene 


MELOGEN ID 


Full length 
(aa) 


Mature length 
(aa)" 


% similarity to 

Arabidopsis 

orthologue'^ 


Genome scaffold and 
linkage group 


CmDXSl 


MELO3C021375 


718 


655 


89 


CM3.5_scaffold00047 (XI) 


CmDXS2a 


MELO3C017499 


732 


672 


72 


CM3.5_scaffold00030 (II) 


CmDXS2b 


MELO3C014965 


721 


687 


75 


CM3.5_scaffold00023 (VI) 


CmDXS2b' 


MELO3C01 4966/3C01 4967 


665" 


628 


64 


CM3.5_scaffold00023 (VI) 


CmDXS3/DXL 


MELO3C01 1 230 


668 


625 


60 


CM3.5_scaffold00014(lll) 


CmDXR 


MELO3C026292 


469 


401 


90 


CM3.5_soaffold00089 (II) 


CmMCT 


MELO3C016431 


304 


241 


74 


CM3.5_soaffold00028 (VI) 


CmCMK 


MELO3C015605 


394 


347 


78 


CM3.5_scaffold00025 (II) 


CmMDS 


MELO3C016339 


230 


182 


86 


CM3.5_scaffold00027 (VII) 


CmHDS 


MELO3C005061 


741 


697 


88 


CM3.5_scaffold00004 (XII) 


CmHDRl 


MELO3C01 8407 


463 


415 


82 


CIVI3.5_scaffold00034 (1) 


CmHDR2 


MELO3C021949 


466 


419 


76 


CM3.5_soaffold00050 (XI) 



^ Estimated from a reconstruction of the coding sequence. Estimated from ChloroP predictions and multiple sequence alignments. Putative 
transit peptides were excluded from tliis calculation, aa, amino acids. 
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encoded by single-copy genes in melon, whereas multiple iso- 
forms code for the first and last enzymes of the pathway, DXS 
and HDR, respectively (Table 1). Based on ChloroP analysis 
of their deduced amino acid sequences, all the encoded pro- 
teins contain a transit peptide which directs the pseudoma- 
ture protein to the plastid. 

Only three of the five predicted melon DXS genes 
encode functional DXS enzymes 

Four full-length sequences showing similarity to DXS 
were found in the melon genome (Table 1). Construction 
of a phylogenetic tree which includes these four DXS can- 
didates as well as other known plant DXS genes indicated 
that only one melon sequence grouped with the type I DXS 
clade (Fig. 1) and was denoted CmDXSl. Two grouped with 
type II sequences, hereafter referred to as CmDXSla and 2b. 
CmDXSla and 2b did not form their own clade amongst 
other angiosperm type II sequences and instead fell into dis- 
tinct subclasses of type II DXS (Fig. 1, clades A and B). The 
hierarchical structure of the type II clade suggests multiple 
rounds of DXS gene duplication have taken place in plants. 
For example, a similar subdivision among gymnosperm type 
II proteins is evident (Fig. 1, clades C and D). However, this 
hierarchy also indicates that type II gymnosperm proteins of 
clade C are more closely related to angiosperm sequences of 
clade A than to their own paralogues (group D). The DXS 
type II B clade includes both mono- and dicotyledonous spe- 
cies and is distinct from the common branch which yields the 
A and C clades, consisting of angiosperm and gymnosperm 
sequences, respectively. Excluding the transit peptide, the two 
melon type II sequences share 76% amino acid identity (90% 
similarity) while CmDXS 1 shares 72 and 75% with CmDXS2a 
and 2b, respectively (86 and 87% similarity). The fourth DXS 
candidate grouped with a third clade of DXS-like sequences 
(type III; Fig. 1 ) with largely unknown functions and was des- 
ignated CmDXS3/DXL (DXS-like). Its deduced amino acid 
sequence possesses 60, 55, and 57% identity to CmDXSl, 2a, 
and 2b, respectively (76, 74, and 73% similarity). 

A fifth DXS candidate initially identified as two sepa- 
rate coding sequences by automatic genome annotation 
(MELO3C014966 and MELO3C014967) is apparently the 
product of tandem gene duplication of CmDXS2b and is 
located directly adjacent to it {CmDXS2h'). However, it has 
been interrupted by an LTR transposable element (position 
5573690-5577498 of the reference genome), and quantita- 
tive RT-PCR assays using specific primers failed to detect the 
presence of transcripts in any tissue or developmental stage 
(data not shown). From this we conclude CmDXS2h' to be a 
non-functional gene. This transposon insertion was present 
in both Ved and PS melon types. 

To verify the predicted enzymatic function for the four full- 
length candidates, we employed a J.w-deficient mutant strain 
of E. coli that had been engineered to convert mevalonate into 
IPP and DMAPP (Campos et ai, 2001). When transformed 
with a plasmid containing a functional DXS gene, the strain 
can grow without mevalonate. Thus, when this strain was 
transformed with a pET32 plasmid encoding Norway spruce 



DXSl (PhilHps et al, 2007) as a positive control, CmDXSl, 
CmDXS2a, or CmDXS2b transformants grew in the absence 
of mevalonate (Fig. 2A). However, no growth was observed 
on mevalonate-free media when the same strain was trans- 
formed with melon DXR (negative control). The dxs mutant 
also failed to grow on media lacking mevalonate when trans- 
formed with pET32 bearing CmDXSSIDXL (Fig. 2A). An 
additional attempt to detect DXS activity for this protein 
was made using a highly sensitive in vitro LC-MS/MS based 
DXS assay which detects DXP formation directly. CmDXS3l 
DXL was subcloned without its predicted transit peptide 
into the pET28 vector for expression in E. coli. Cultures har- 
bouring the same plasmid encoding the native E.coli DXS 
or non-transformed cells were used as positive and negative 
controls, respectively. Crude cell-free protein extracts of cul- 
tures overexpressing E. coli DXS demonstrated high levels 
of DXP formation under optimized in vitro assay conditions 
(Fig. 2B), while non-transformed crude extracts produced a 
lower but easily quantifiable level of DXP due to endogenous 
DXS activity. Surprisingly, total protein extracts of cultures 
expressing CmDXS3/DXL produced no detectable DXP 
under the same conditions. One possible explanation for 
this apparent inhibitory effect could be that CmDXS3/DXL 
catalyses a distinct but related reaction which may involve 
one of the two substrates required by DXS. Given the lack of 
demonstrable DXS activity, we refer to CmDXS3/DXL here- 
after as CmDXL (for DXS-Hke protein), following previous 
nomenclature recommendations (Phillips et al, 2008). 

Two genes encode functional HDR enzymes in melon 

Two sequences showing similarity to known plant HDRs were 
found in the melon genome and designated CmHDRl and 
CmHDR2 (Table 1). The melon HDRl and HDR2 proteins 
share 78% amino acid identity and 90% similarity (exclud- 
ing transit peptides). A phylogenetic analysis of HDR genes 
across the plant kingdom shows that the presence of multiple 
copies in a genome is a fairly common occurrence (Fig. 3). 
Unhke the well separated clades in the tree constructed from 
DXS protein sequences (Fig. 1), the phylogenetic distribu- 
tion of HDR sequences is less evident. In species with multi- 
ple HDR genes, we detected two separate patterns. The first 
includes angiosperms with two HDR isoforms which resem- 
ble each other more than orthologous sequences from related 
species (Fig. 3, inset). For example, banana, apple (Velasco 
et al, 2010), soybean (Schmutz et al, 2010), canola (Wang 
et al., 201 1), and poplar each possess two HDR genes which 
form a species-specific subclade relative to their nearest neigh- 
bours. Each of these species has undergone at least one recent 
whole-genome duplication (WGD) event (Parkin et al, 2003; 
Tuskan et al, 2006; Schlueter et al, 2007; Han et al, 2011; 
D'Hont et al, 2012). This fact, coupled to the high degree 
of similarity between paralogous pairs, suggests these HDR 
isoforms may be homoeologues produced by polyploidy. The 
second pattern, which includes spruce, ginkgo, pine, melon, 
watermelon, and cucumber, features separate HDR classes in 
which member similarity is higher than the similarity between 
paralogous pairs within a species. Gymnosperm (Fig. 3A and 
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Fig. 1. Phylogenetic analysis of DXS protein sequences. Sequences were aiigned witln IVIUSCLE (v3.7). Construction of the pliylogenetic tree was 
based on tlie maximum iil<;eiiliood method of the PhylVIL program (v3.0 aLRT). Graphical representation and edition of the phylogenetic tree were 
performed with Interactive Tree of Life (ITOL). DXS1 , DXS2 and DXS3 clades are represented in purple, green, and yellow, respectively A and B signify 
angiosperm type II subclasses, while C and D represent similar subclasses for gymnosperm type II proteins. Melon DXS sequences are boxed. Internal 
branching reliability (values not shown) was established using the aLRT test. The sequences used in the tree were as follows: AaesDXS2: ABK35283.1 
Adonis aestivalis var palaestina; AannDXSI : AAD56390.2 Artemisia annua; AmajDXS2: AAW28999.1 Antirrhinum majus; ApanDXSI : AAP14353.1 
Andrographis panicuiata; AsinDXS3: AFU75320.1 Aquiiaria sinensis; AthaDXLI : At3g21 500 /^rab/dops/s thaiiana; AthaDXS/CU\1 : At4g1 55560 
A. thaiiana; AthaDXS3/DXL2: At5g11380A. thaiiana; AvilDXSI : ACR02668.1 Amomum villosum: BrapDXSI : ABE60813.1 Brassica rapa; CannDXSI: 
Y1 5782 Capsicum annuum; CmelDXSI : MELO3C01 1 230 Cucumis melo; CmelDXS2a: MELO3C01 7499 C. melo; CmelDXS2b: MELO3C01 4965 
C. melo; CmelDXS3/DXL: MELO3C021375 C. melo; CmorDXS2: BAE79547.1 Chrysanthemum x morifolium; CrosDXS2a: CAA09804.2 Catharanthus 
roseus; CrosDXS2b: ABI35993.1 C. roseus; CsteDXSI : BAF75640.1 Croton stellatopilosus; DradDXS (outgroup): NP_295198 Deinococcus 
radiodurans; EguiDXSI : AAS99588.1 Elaeis guineensis; HbraDXSI : AAS941 23.1 Hevea brasiliensis; HbraDXS2: ABF1 8929.1 H. brasiliensis; GbarDXSI : 
ABN13970.1 Gossypium barbadense; GbilDXSI : AAS89341 .1 Ginkgo biloba; GbilDXS2: AAR95699.1 G. biloba; GmaxDXSI : AC072582.1 Glycine max; 
GmaxDXS3: XP_003527796.1 G. max; McitDXS2: AAL32062.1 Morinda citrifolia; MpipDXS2: AAC33513.1 Mentha x piperita; MtruDXSI : GAD22530 
Medicago truncatula; MtruDXS2: CAN89181.1 M. truncatula; MtruDXS3: XP_003603440.1 Medicago truncatula; NpseDXS2: CAC08458.1 Narcissus 
pseudonarcissus; NtabDXSI : CBA1 2009.1 Nicotiana tabacum; NtazDXS2: ADD82535.1 Narcissus tazettavar. Chinensis; OsatDXSI: NP_001 055524.1 
Oryza sativa Japonica; OsatDXS2: NP_001 059086.1 O. sativa Japonica; OsatDXS3: BAA83576 O. sativa Japonica; PablDXSI : ABS5051 8 P/cea abies; 
Pabi DXS2a: ABS50519 P abies; PabiDXS2b: ABS50520 P. abies; PdenDXSI : ACC54557.1 Pinus densifiora; PdenDXS2: ACC54554.1 P densiflora; 
PmonDXSI: AAQ84169.1 Pueraria montana var. Lobata; PpatDXS2: XP_001 756357.1 Physcomitrella patens subsp. patens; PtaeDXS2a: ACJ67020.1 
Pinus taeda; PtaeDXS2b: ACJ67021.1 P taeda (originally annotated as type I DXS); PtriDXSI: XP_002312717.1 Populus trichocarpa; PtriDXS2a: 
XP_002331 678.1 P trichocarpa; PtriDXS2b; XP_00230341 6.1 P trichocarpa; PtriDXS3: XP_002308644.1 P trichocarpa; RcomDXSI : XP_00251 6843.1 
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Fig. 2. Confirmation of enzyme activity in meion DXS and HDR genes. 
(A) Complementation of a dxs- £ coli mutant witli melon DXS isoforms. 
The dxs- strain can grow normally in the presence of 1 mM mevaionate 
but requires transformation with a plasmid encoding a functional DXS 
gene for viability on mevalonate-free media. Picea abies DXS1 (PaDXSI) 
is included as a positive control (Phillips et al., 2007) while melon 
1 -deoxyxyulose 5-phosphate reductoisomerase (CmDXR) was used as a 
negative control. (B) Crude lysates of £ coli overnight cultures expressing 
CmDXS proteins were used in DXS in vitro reactions with pyruvate, 
glyceraldehyde 3-phosphate, and all necessary co-factors. DXP was 
analysed by LC-MS/MS. The non-transformed control (NT) indicates 
endogenous DXS levels. (C) complementation of the lidr £ coli mutant 
MG1655 araoispH (McAteer et al. , 2001) with melon HDR isoforms. The 
LB-carbenicillin-kanamycin plate on the left also contains 0.2% arabinose, 
which induces expression of the endogenous HDR gene in this mutant 
strain. The plate on the right contains the same antibiotics and 0.2% 
glucose, which represses endogenous HDR expression. Under these 
conditions, the strain is viable only when transformed with a plasmid 
encoding afunctional HDR gene. 



B) and cucurbit (Fig. 3C and D) HDR paralogues formed 
independent subclades each consisting of type I and type 
II isoforms. Interestingly, none of the genomes in this sec- 
ond group has undergone recent genome duphcation events 
(Ahuja, 2005; Huang et al., 2009; Garcia-Mas et al., 2012; 
Guo et al., 2013; Nystedt et a/., 2013). The lack of polyploidy 
or partial genome duplications among these species and the 
greater phylogenetic distance between their paralogues, in 
contrast to the first group, implies such paralogues may have 
arisen through individual gene duplications. 

The activity of both melon HDR isoforms was affirmed 
using a bacterial complementation assay. The MG1655 
araoIspH mutant strain expresses an endogenous HDR 
under the control of the inducible araBAD promoter, which 
is alternately induced by arabinose or repressed by glucose 
(McAteer et al., 2001). In the latter case, the strain requires 
transformation with a plasmid encoding a functional HDR 
for viability. We transformed competent cells made from this 
strain using both melon HDR isoforms as well as the E. coli 
HDR as a positive control. Once transferred to 0.2% glucose 
media, this mutant strain grew normally when transformed 
with either CmHDRl or CmHDR2 (or with the positive con- 
trol) but not when transformed with CmDXL (here used as 
a negative control) (Fig. 2C). These results demonstrate that 
both CmHDRl and CmHDR2 isoforms are active HDR 
enzymes, in agreement with previous reports of HDR paralo- 
gous pairs from gymnosperms (Kim et al., 2008). 

Differential expression of transcripts for different DXS 
and HDR isoforms suggests functional divergence 

To address the possibility of functional divergence among 
melon DXS and HDR isoforms, we first carried out a com- 
prehensive characterization of their expression patterns 
in different organs of adult melon plants from pale and 
orange varieties. In order to enable comparison among iso- 
forms, fold-change calculations were calibrated to a com- 
mon median expression level of DXS and HDR genes. In the 
pale-fleshed PS variety, CmDXSl was expressed in all organs 
analysed with the exception of roots. By contrast, transcripts 
for both type II DXS isoforms, CmDXSla and CmDXSlb, 
were most abundant in flowers (Fig. 4). The CmDXL isoform 
showed high transcript levels in all organs, including roots. In 
the case of HDR-encoding genes, CmHDRl expression was 
almost undetectable, whereas CmHDRl transcript levels were 
much higher than those of any DXS isoform in all organs 
tested (Fig. 4). The particularly high levels of CmHDRl 
transcripts in flowers, similar to that observed for CmDXS2a 
and CmDXSlb, suggest that plastidial isoprenoids pro- 
duced in melon flowers (mainly carotenoid pigments) require 



RIcinus communis; RcomDXS2: XP_002533688.1 R. communis; RcomDXSS: XP_0025 14364.1 R. communis; SbicDXS3: XP^002437810.1 Sorghum 
bicolour; ShabDXS2: AY687353 Solanum habrochaites; SmilDXSI : ACF21004.1 Salvia mlltiorrhlza; SmilDXS2: ACQ66107.1 S. miltlorrhlza; SlycDXSI : 
CAZ66648.1 Solanum lycoperslcum; SlycDXS2: CAZ66649.1 S. lycoperslcum; SrebDXS2: AJ429232 Stevia rebaudiana; TereDXS2: AF251020 
Tagetes erecta; VvinDXSI : XP_00227 1585.1 Vltis vinlfera; VvinDXS2: XP_002271 782.2 V. vinifera; VvinDXS3: XP_002277919.1 V. vinifera; ZmayDXSI : 
ACG27905.1 Zea mays; ZmayDXS2: DAA59892.1 Z. mays; ZmayDXS3: ABP88135.1 Z. mays. A class number was appended to sequences not 
originally annotated according to DXS class (type I, II, or III) according to the clade where they group in our analysis. Species with multiple type 2 DXS 
sequences are referred to here as DXS2a and DXS2b if not originally annotated as such. Bar, 0.1 amino acid substitutions per site. 
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Fig. 3. Phylogenetic analysis of plant HDR protein sequences. Sequences were aligned with MUSCLE (v3.7). Construction of the phylogenetic tree 
was based on the maximum likelihood method of the PhyML program {v3.0 aLRT). Graphical representation and edition of the phylogenetic tree 
were performed with Interactive Tree of Life (ITOL, itol.embl.de). Internal branching reliability (values not shown) was established using the aLRT test. 
Gymnosperm HDR1 (A) and HDR2 (B) clades are displayed in red and green, respectively. Cucurbit HDR2 (C) and HDR1 (D) clades are displayed 
in yellow and purple, respectively For those sequences not available on NCBI, the protein code in bold corresponds to the genome sequence 
annotation. The sequences used in the tree were as follows: AthaHDR: NP_^567965 Arabidopsis thaliana; AlyrHDR: XP_0028671 22 A. lyrata; BdisHDR: 
XP_003560190 Brachypodium distachyon; BrapHDRI : Bra034620 Brassica rapa; BrapHDR2: Bra011522 B. rapa; ClanHDRI : Cla015963 Citrullus 
lanatus; ClanHDR2: Cla010297 C. lanatus; CmelHDRI : MELO3C018407P1 Cucumis melo; CmelHDR2: MELO3C021949P1 C. melo; CpapHDR: 
CP00062G01010 Carica papaya; CsatHDRI : XP_004162740 Cucumis sativus; CsatHDR2: XP_004145400 C. sativus; CsinHDR: orange1.1g01 6939m 
Citrus sinensis; FvesHDR: XP_004289689 Fragaria vesca; GhirHDRI : Gorai.OOSGOl 9000.1 Gossypium hirsutum; GhirHDR2: Gorai.005G229100.1 
G. hirsutum; GhirHDRS: Gorai.013G052300.1 G. Iiirsutum; GbillDSI : ABB78088 Gini<go biioba; GbillDS2: ABB78090 G. biioba; GbillDS2-1 : ABB78089 
G. biioba; GmaxHDRI : XP^003537898 Giycine max; GmaxHDR2: XP„003541082 G. max; MacuHDRI : GSMUA^Achr8P04900_001 iViusa acuminata; 
MacuHDR2: GSMUA_Achr3P25710„001 M. acuminata; MdomHDRI: MDP0000762895 Ma/us domestica; MdomHDR2: MDP0000029783 
M. domestica; MescHDRI: cassava4.1_007197m /Wan//)otesc(j/enfa; MescHDR2: cassava4. 1^0071 71 m /W. escuienta; CsatHDRI: NP^001051167 
Oryza sativa; CsatHDR2: NP_001 051 1 68; PsitlDSI : ACN40284 Picea sitctiensis; PsitlDS2: ACN39959 P. sitcliensis; PdenHDRI : ACC54560 Pinus 
densiflora; PdenHDR2: ACC54561 R densiflora; PperHDR: EMJ01027 Prunus persica; PtaelDSI : AB026587 Pinus taeda; PtaelDS2: AB026588 
R taeda; PtriHDRI : XP_00230541 3 Popu/us fr/c/iocarpa; PtriHDR2: XP_00231381 6 R triciiocarpa; PvulHDR: Pvul.011G047900.1 Pliaseoius vuigaris; 
RcomHDR: XP_002519102 Ricinus communis; SbicHDR: Sb01g009140.1 Sorgtium bicoiour; SlycHDR: NP_001 234728 Soianum iycopersicum; 
SmoeHDR (outgroup): XP_002967048 Seiagineiia moeiiendorffii; StubHDR: PGSC0003DMP400043678 Soianum tuberosum; TcacHDR: ECX98548 
Tiieobroma cacao; VvinHDR: XP J02284659 Vitis vinifera; ZmalHDR: NP_001 1 69300 Zea mays. Bar, 0.01 amino acid substitutions per site. Inset, 
amino acid identity between full-length HDR homoeologues (black) from polyploid angiosperms, HDR paralogues from gymnosperms (white), or HDR 
paralogues from cucurbits (grey). 



CmHDRl and type II DXS isoforms at high levels. These 
expression data are consistent with organ-specific functional 
specialization of different isoforms of MEP-pathway genes. 
We compared this data to organ specific expression in the 
orange-fleshed melon variety Dul, a close relative of Ved. 
It resembled PS in most respects, such as expression levels 
of CmDXS2b in flower tissue far above that of leaf tissue 
(Fig. 4). CmHDR2 expression was absent in most tissues. 
While low compared to other DXS and HDR genes (Fig. 4), 
its expression in roots was nonetheless detectable. This stood 
in contrast to CmHDR2 expression in PS tissues, in which 
CmHDR2 transcripts were completely absent from all organs 
surveyed. 

De-etiolation (greening) is a major physiological transi- 
tion to photosynthetic development which requires copious 
amounts of isoprenoid precursors. We further investigated 



the differential expression of specific DXS and HDR iso- 
forms by examining their transcript levels in 4-day-old etio- 
lated seedHngs at various time points following exposure to 
continuous white light. Expression levels of CmDXSl and 
CmDXL were considerably higher than their counterparts 
CmDXSla and CmDXSlb both before and during light 
induction (Fig. 5), reaching approximately 6- and 24-fold 
higher transcript levels at 24 h (fully de-etiolated seedlings) 
relative to those at Oh (etiolated seedhngs). This observation 
suggests that CmDXSl is the main DXS isoform providing 
isoprenoid precursors for the transition to photosynthetic 
development, consistent with previously reported roles of 
type I DXS isoforms supplying isoprenoid equivalents for 
photosynthetic metaboHsm (Walter et ai, 2002). In the case 
of HDR, CmHDRl transcripts were the most abundant 
and the only ones that showed a substantial upregulation 
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Fig. 4. Relative transcript abundance of DXS and iHDR isoforms in organs 
of aduit PS (top) or Dui (bottom) piants. cDNA loading was normalized 
using three distinct reference genes (APT1 , ACT2/8, and CYC). RNA was 
extracted from three individual plants for each organ (n = 3) and assayed 
by quantitative RT-PCR using SYBR Green as three technical replicates 
each. Relative fold-change calculations were performed using the AACt 
model (Lival< and Schmittgen, 2001) by normalizing all measurements to 
the median gene expression in the leaf for all DXS and HDR genes. The 
dashed line signifies a discontinuity in the vertical axis. Error bars indicate 
the standard error. 

upon illumination (Fig. 5), although low levels of CmHDRl 
transcripts were also detected. CmHDRl supplies IPP and 
DMAPP for the production of isoprenoids in flowers (Fig. 4) 
and de-etiolating seedlings (Fig. 5), whereas CmHDR2 may 
only be expressed in specialized contexts. 
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Fig. 5. Transcript profiling of etiolated seedlings exposed to white light 
for 4, 8, or 24 h. For each time point, RNA was harvested from three 
individual seedlings and converted to cDNA as described in Materials and 
Methods. Quantitative RT-PCR was performed using SYBR Green assays, 
and cDNA loading was normalized using three separate normalizer genes 
(APT1 , ACT2/8, and CYC). Fold-change calculations were calibrated to 
the mean expression level of all DXS and HDR genes using the AACt 
method (Lival< and Schmittgen, 2001). Values shown are the averages of 
three independent replicate assays in three technical replicates. Error bars 
indicate the standard error. 



HDR1 and type II DXS isoforms are involved in the 
production of carotenoids in melon fruit 

To confirm whether specific isoforms might be involved 
in the production of MEP-derived isoprenoids in certain 
organs or fruit developmental stages, we next compared 
the expression of genes encoding DXS and HDR isoforms 
with the accumulation of different plastidial isoprenoid 
end-products during melon fruit development in PS and 
Ved varieties, which produce white and orange-fleshed 
(carotenoid-accumulating) fruits, respectively, when ripe. 
As shown in Fig. 6, analysis of MEP-derived isoprenoid 
end-products in developing melon fruit by HPLC-DAD- 
FLR revealed major differences between PS and Ved in the 
production of carotenoids (mainly p-carotene) and toco- 
pherols. Only low levels of (3-carotene were detected early in 
PS fruit development, and these decreased to barely detect- 
able levels towards fruit maturation and ripening (Fig. 6). 
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In contrast, fruit ripening in Ved was accompanied by a 
dramatic accumulation of (3-carotene, reaching 2.76 ±0.05 
mg-g ' DW by harvest. Compared to immature fruit 
(15 DAP), this corresponds to a nearly 80-fold increase. 
Chlorophyll a and h were present at low levels at earlier 
stages of fruit development of both strains and detected 
only intermittently thereafter. In ripe fruit, very low levels 
of chlorophylls remained in both varieties (Fig. 6). In the 
case of tocopherols, both a and y tocopherol increased as 
fruit developed but with different profiles in PS and Ved 
(Fig. 6). The level of tocopherols increased in parallel over 
time up to 15 DAP (a) or 25 DAP (y) in both varieties. 
From that stage, a-tocopherol modestly increased in ripen- 
ing PS fruit whereas y-tocopherol substantially increased 
in the Ved variety. As a result, ripe PS fruit accumulated 
roughly equal amounts of a- and y-tocopherol (5-6 ^ig-g ' 
DW) whereas Ved fruit contained almost three times more 
y-tocopherol than a-tocopherol (Fig. 6). In terms of total 
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Fig. 6. Time course of isoprenoid end-product accumulation in PS and 
Ved fruit. Developing melon fruit were sampled on the days indicated 
(three technical replicates from three individual fruits per time point) and 
extracted, isoprenoid metabolites in these extracts were separated 
by HPLC and quantified by comparing their DAD (for chiorophylis and 
carotenoids) or FLR (for tocopherols) signals to external calibration curves 
constructed with authentic standards. A chiorophyii b standard curve was 
applied to both chlorophylls and a-tocopheroi was similarly employed to 
quantify both tocopherols. Error bars indicated the standard error of three 
biological replicates extracted in triplicate. Recoveries were calculated 
based on individual recoveries of the internal standard canthaxanthin 
added during the initial extraction step. 



tocopherol content, Ved had approximately 50% more 
tocopherols than PS (16.4 vs 11.0 ^g-g ' DW). 

When the expression of DXS and HDR family genes was 
analysed in developing PS and Ved fruit, no induction of 
CmDXSl or CmHDRl was observed during ripening for 
either cultivar (Fig. 7). By contrast, the rest of the genes ana- 
lysed {CmDXS2a, CmDXSlh, CmDXL, and CmHDRl ) were 
upregulated during fruit development in Ved but not in PS. 
Because CmDXL was found to be inactive as a DXS enzyme 
(Fig. 2), we conclude that CmDXS2a, CmDXS2b, and 
CmHDRl are most likely the isoforms involved in supplying 
the precursors required not only for carotenoid (p-carotene) 
but also for y-tocopherol production during orange-fleshed 
Ved fruit ripening. 

To determine if cw-acting regulatory elements could 
account for the observed transcript differences between Ved 
and PS, we used whole genome re-sequence data to extract 
the promoter (2kb) and 3' regions (1 kb) of the these genes 
{CmDXSl a, CmDXS2b, and CmHDRl) for each variety 
(PS and Ved). We observed no differences in CmHDRl cis 
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Fig. 7. Transcript profiling in developing melon fruit. Quantitative RT-PCR 
SYBR Green assays were performed at different development fruit stages 
for PS (open circles) or Ved (filled squares) fruit. RNA was extracted from 
samples from three individual melons. cDNA loading was normalized 
using the CYC, APT1 , and ACT2/8 genes. Relative fold calculations were 
based on the median gene expression of ail DXS and HDR genes from a 
given variety using the AACt method (Livak and Schmittgen, 2001). Values 
shown are the averages of three biological replicates assayed in triplicate. 
Error bars represent the standard error 
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regulatory regions when PS and Ved were compared, imply- 
ing that regulation by trans-acting elements is at least partially 
responsible for the observed induction in Ved. We likewise 
observed no differences in the untranslated regions or pre- 
dicted transcriptional start sites of CmDXS2a or CmDXSlb 
when PS and Ved were compared. However, a comparison 
of the promoter regions between varieties revealed subtle 
differences for both CmDXSla and CmDXS2b, consist- 
ing of 12 SNPs and a 3-bp deletion in Ved in the case of 
CmDXS2a and 15 SNPs and a 3-bp deletion in Ved in the 
case of CmDXSlb. An analysis of known transcription fac- 
tor binding sites revealed many differences based on these 
nucleotides changes. A full listing of transcription factor 
binding site differences between varieties for these two genes 
is provided in Supplementary Table 2. BOXIINTPATPB 
and POLLEN 1LELAT52 represent the only motifs present 
in the promoter regions of both CmDXS2a and CmDXS2b 
of Ved that are absent in PS. While these differences could 
in principal account for the differences in transcriptional 
regulation of type II DXS genes between white- and orange- 
fleshed varieties, the lack of differences in cis elements of 
CmHDRl suggests that carotenoid formation in melon fruit 
is a complex process with multiple regulatory mechanisms 
at work. 

Discussion 

Melon DXS type I and II family members are 
functionally specialized for primary and secondary 
isoprenoid production, respectively 

DXS is represented as a small gene family in most plant 
species, and here we have investigated the roles of the four 
melon genes encoding proteins with high similarity to 
known DXS proteins: one type I (CmDXSl), two type II 
iCmDXS2a and CmDXS2b), and one type III (CmDXL). 
While CmDXSl, CmDXS2a, and CmDXS2b encode func- 
tional DXS proteins, CmDXL does not. A phylogenetic tree 
describing the evolutionary relationship of known plant 
DXS proteins shows that the melon DXS proteins group 
within their expected type I, II, or III (DXS-like) catego- 
ries rather than with themselves, in keeping with the general 
observation that duplication and functional specialization 
of these genes preceded the separation of the major plant 
lineages (Walter et al. , 2002). The phylogenetic evidence also 
suggests that distinct subclasses within type II DXS isoforms 
are a genuine feature within the type II DXS clade (Fig. 1 A- 
D). Based on the hierarchical organization of DXS protein 
sequences, the duplication of type II DXS genes may have 
occurred on multiple occasions. Unexpectedly, angiosperm 
sequences of clade A (Fig. 1) are more closely related to 
the gymnosperm type II proteins of clade C than to their 
own intraspecific paralogues, and a similar relationship for 
gymnosperm type II DXS paralogues exists. This could be 
explained by the duplication of type II DXS genes both 
before and after the separation of angiosperm and gymno- 
sperm lines. Additional research is necessary to determine 
whether clade B is an evolutionarily recent offshoot from 



the common branch giving rise to angiosperm and gymno- 
sperm type II DXS genes (Fig. 1, clades A and C). While 
transcript profiling did not suggest functional speciaHzation 
of CmDXS2a relative to CmDXS2h, the hierarchical group- 
ing of these sequences into classes, rather than forming their 
own clade, suggests functional divergence is under way. 

CmDXSl is induced by light and is highly expressed in 
leaves and flowers, which is consistent with a role in pro- 
viding isoprenoid equivalents for primary metabolism 
(i.e. photosynthesis). CmDXS2a and CmDXS2b, however, 
are primarily expressed in flower, ovaries, and developing 
fruit. Based on a time course to observe the accumula- 
tion of (3-carotene in Ved fruit following pollination, the 
rate of accumulation of this compound increased consid- 
erably after 15 DAP and then continued to increase until 
harvest (Fig. 6). The white-fleshed variety PS, in contrast, 
accumulates only trace levels of chlorophylls or carotenes 
(Fig. 6). CmDXS2a and CmDXS2b are the dominant DXS 
isoforms in developing fruit of orange varieties, and hence 
they are probably involved in providing precursors for the 
high levels of plastidial isoprenoids (including (3-carotene 
and, to a much lower extent, y-tocopherol) that accumu- 
late during ripening. In contrast, neither of these type II 
DXS transcripts is upregulated during fruit development 
in the PS variety which lacks (3-carotene, implying that 
elevated DXS activity is only necessary in varieties which 
accumulate high amounts of carotenoid end products. 
We identified two c«-regulatory elements present in the 
promoters of Ved CmDXS2a and CmDXS2b which were 
absent in the promoters of the corresponding PS genes: 
BOXIINTPATPB, a non-consensus type II promoter of 
plastid genes identified in tobacco (Kapoor and Sugiura, 
1999) and POLLEN1LELAT52, a motif required for pol- 
len-specific expression in tomato (Bate and Twell, 1998) 
(Supplementary Table 2). Curiously, carotenoid cleavage 
dioxygenase (CmCCDl), which produces volatile apoca- 
rotenoids from (5-carotene through a 9,10-oxidative cleav- 
age, is upregulated both in orange-fleshed varieties as well 
as in white varieties that accumulate neither carotenoids 
nor apocarotenoids (Ibdah et al, 2006). This suggests the 
transcriptional programme for (5-carotene catabolism is 
uncoupled from that of precursor production via the MEP 
pathway. 

Type II DXS isoforms in other plant species have been 
implicated in defensive oleoresin production (Phillips et al, 
2007), ginkgolide natural product formation (Kim et al, 
2006), and apocarotenoid production in association with 
arbuscular mycorrhizas (Walter et al , 2000), but this is the 
first report of a type II DXS involved in specialized isopre- 
noid biosynthesis in the fruit of an agronomically important 
crop species such as melon. Citrus, for instance, relies on a 
type I DXS for carotenoid formation during fruit ripening 
(Peng et al, 2013). In tomato, the best characterized system 
for fleshy fruit-producing agricultural crops, a type I DXS 
isoform also appears to supply the precursors for carotenoid 
biosynthesis during fruit ripening whereas the type II enzyme 
is expressed preferentially in trichomes (Lois et al, 2000; 
Paetzold et a/., 2010). 
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DXL expression correlates with isoprenoid biosynthesis 
but does not encode a functionai DXS protein 

CmDXL was induced during de-etiolation in Ved (Fig. 5) and 
during fruit ripening in Ved but not PS (Fig. 7). The failure to 
detect authentic DXS activity in CmDXL by two independ- 
ent methods, coupled to its highly upregulated transcripts 
in developing fruit and de-etiolating seedlings, suggests it 
encodes a protein with an activity different from that of DXS. 
Arahidopsis AtDXLl (a type I DXS) was found to be a non- 
essential gene which similarly lacks DXS activity (Carretero- 
Paulet et al, 2013). Unhke its functional paralogue CLAl/ 
DXSl, it is not expressed in leaves or induced by light and 
is mainly expressed in siliques and flowers. An evolutionary 
analysis of this plastid-localized protein suggested it had 
evolved recently through gene duplication of CLAl/DXSl 
in the Brassicaceae Hne (Carretero-Paulet et al., 2013). The 
additional observation that CmDXL suppresses endogenous 
DXS activity in in vitro assays of crude protein preparations 
(compared to non-transformed controls) suggests it may share 
a common substrate with DXS. In that event, CmDXL would 
effectively mask endogenous activity in vitro due to its affinity 
for a substrate required by DXS. An analysis of the CmDXL 
amino acid sequence shows that it contains many typical fea- 
tures of a DXS enzyme, including a thiamin diphosphate co- 
factor binding domain, a transketolase domain, and essential 
amino acids implicated in the DXS mechanism (His"^, 
Asp'^^ Glu^'^ Arg^''^ and Arg^**^; Fig. 8) (Xiang et al, 2007). 
However, amino acids 182-195 of CmDXL depart sharply 
from the conserved motif in the corresponding position of 
type I and II DXS sequences, and this motif is also diver- 
gent in maize ZmDXS3 and Arahidopsis AtDXL2 (Fig. 8). 
CmDXL also lacks a conserved HGVXKFPD motif present 
in DXS proteins of classes I and II (corresponding to amino 
acids 377-384 of CmDXSl). In another type III clade mem- 
ber, Arahidopsis DXL2, this motif is also absent, and in maize 
ZmDXS3 it is substituted by an unrelated sequence (Fig. 8). 
Based on the E. coli DXS 3D structure, the first residue of 
this motif (His^"'* in the E. coli protein) is predicted to par- 
ticipate in binding of glyceraldehyde 3-phosphate. Given the 
absence of this conserved residue in CmDXL, an inability 
to bind glyceraldehyde 3-phosphate could explain its lack of 
DXS activity. The expression of ZmDXS3 is linked to carot- 
enoid accumulation in maize endosperm (Vallabhaneni and 
Wurtzel, 2009), and a plasmid bearing this gene was capable 
of rescuing a Jx^-deficient E. coli strain when grown in liquid 
culture (Cordoba et al., 2011). However, this approach may 
also favour spontaneous recruitment of DXS activity from 
other genes (Sauret-Gueto et al, 2003; Sauret-Gueto et al., 
2006; Perez-Gil et al., 2012). Additional research is required 
to definitively assign a biochemical function to members of 
this clade. 



The mode ofHDR paralogue acquisition differs across 
plant lineages and Impacts subfunctionallzatlon 

The melon genome possesses two functional HDR genes, 
one of which, CmHDRl, is expressed at high levels in most 



tissues and developmental processes. CmHDR2 transcripts 
were only detected in de-etiolating cotyledons and in roots 
of orange-fleshed varieties. However, a similar pattern of 
transcript expression in young leaves and roots was reported 
for Ginkgo biloha GhHDR2 (Kang et al, 2013). This sup- 
ported previous transcript profiling of MEP-pathway genes 
in cultured gingko embryos (Kim et al., 2008) and Pinus 
densijiora (Kim et al. , 2009) which suggested a speciaHzed 
role for gymnosperm HDR2 proteins in specialized isopre- 
noid biosynthesis; for example, gingkolide biosynthesis in 
roots of G. biloha and diterpenoid resin acid biosynthesis in 
pine species. Phylogenetic analysis also supported the exist- 
ence of subclades of HDR which reflect an ancient func- 
tional specialization of HDR paralogs (Kim et al, 2008). 
We observed that the ancient division yielding HDR para- 
logues in modern gymnosperms (Fig. 3 A and B) described 
by Kim et al. (2008) was not fully preserved in angiosperms. 
Instead, members of HDR clade B share a common ances- 
tor with all angiospermous HDR sequences. Further divi- 
sions of this group have occurred independently in some 
flowering plant families, leading to divergent HDR copies 
in the Cucurbitaceae (Fig. 3C and D). However, this anal- 
ysis also revealed many angiosperm lines, such as apple, 
soybean, rape, cotton, banana, and poplar, with two highly 
similar copies of HDR (Fig. 3 and inset) and no indica- 
tion of functional divergence which could be inferred by 
sequence relatedness. Unlike the clear hierarchical divi- 
sion of DXS types, the phylogeny of HDR suggests at least 
two distinct legacies for HDR paralogue acquisition are 
represented here. 

We considered the role of large-scale genome duplica- 
tion events in the duplication of MEP-pathway genes and 
observed an absolute correlation between species with highly 
similar HDR paralogues, including banana, soybean, poplar, 
rape, and apple, and a history of polyploidy. In these spe- 
cies, HDR paralogues most likely represent homoeologous 
sequences produced by large scale or WGD events. On the 
other hand, HDR paralogues which arose in species with no 
recent record of genome duplications, such as the cucurbits 
(Garcia-Mas et al, 2012) and gymnosperms (Nystedt et al, 
2013), are probably products of tandem or segmental dupH- 
cations. These small-scale duplications can lead to subfunc- 
tionalization more rapidly than duplications derived from 
WGD events (Flagel and Wendel, 2009). The duplication- 
degeneration-complementation (DDC) model of subfunc- 
tionalization posits that duplicates arising from an ancestral 
gene with multiple functions (including multiple expression 
domains) may become quickly specialized by accumulating 
mutations which partition the ancestral functions such that 
both dupHcates acquire essential roles (Force et al, 1999), 
resulting in the preservation of both genes. Many models of 
gene duplication and evolution have been proposed, and mul- 
tiple modes may act in concert to preserve duplicate genes 
(Innan and Kondrashov, 2010). For example, escape from 
adaptive conflict (EAC) (Des Marais and Rausher, 2008) 
could also explain the origin of specialized HDR paralogues 
if multiple functions performed by the original gene could 
not be improved upon independently. While a combination of 
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Fig. 8. Alignment of selected type I and II DXS sequences. The alignment was created using BLOSSUI\/162 with a gap-opening penalty of 10 and 
a gap-extension penalty of 0.05. It includes the presumed mature peptide only (transit peptides were manually curated based on an alignment 
of the full-length peptide). Residues shaded in black are absolutely conserved while those in grey are conserved among 70% of sequences 
analysed. The solid bar indicates a deletion in type III DXS sequences. Asterisks indicate residues identified as critical for catalysis in the E. coli 
protein (Xiang, et al. 2007). The first two letters of each sequence indicates the corresponding species: Aa, Artemisia annua; Mt, Medicago 
truncatula; Cm, Cucumis melo; Zm, Zea mays; At, Arabidopsis tiialiana; Pa, Picea abies; Ec, Escherichia coli; Cr, Catharanthus roseus; 
CmDXSS, melon DXL 
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subfunctionalization, DDC, and EAC may explain retention 
of specialized HDR paralogues in species without a history 
of WGDs, the existence of such homoeologues in banana, 
apple, poplar, rape, and soybean is perhaps more difficult to 
explain considering WGD should result in multiple copies of 
upstream steps as well. 

The observation that only the first and last steps of the 
MEP pathway are frequently encoded in the genome as gene 
families may be explained by gene dosage imbalance. Briefly, 
this hypothesis maintains that stoichiometric imbalances in 
macromolecular complexes can lead to dosage-dependent 
phenotypes (Veitia et al, 2008). Gene balance can deter- 
mine which genes are preferentially retained after single 
gene dupHcations or WGD events. Overexpression (via gene 
duphcation) may be harmful if that gene product forms a 
multi-subunit complex. A high number of subunits in a com- 
plex increases the probability that the members are encoded 
by singleton genes (Papp et al., 2003; Yang et al., 2003). In 
such cases, WGDs are followed by gene loss and pseudogeni- 
zation. The existence of multi-enzyme complexes formed by 
enzymes of the MEP pathway has not been studied in plants. 
However, in some bacterial species, central steps of the MEP 
pathway (but not DXS or HDR) do form such complexes 
(Gabrielsen et al., 2004; Lherbet et al., 2006). If similar com- 
plexes exist in plants, gene dosage balance may be responsi- 
ble for the loss of multiple copies of these central steps while 
multiple copies of DXS and HDR are retained in some (but 
not all) cases. 

Here we have described the role of specialized isoforms 
encoding regulated steps of the MEP pathway in melon 
and provided evidence for their functional specialization 
in seedling greening and fruit development. Understanding 
the regulation of carotene accumulation in melon fruit flesh 
is important for developing new tools for breeding pro- 
grammes in this species, and we have identified CmDXS2a, 
CmDXS2b, and CmHDRl as genes important for fruit col- 
our and quahty. We have also demonstrated that CmDXL, 
a member of the type III DXS clade, is not a genuine DXS. 
These results contribute to a better understanding of evo- 
lution of the MEP pathway in the following ways. First, 
our findings indicate that in addition to an ancient sepa- 
ration of DXS into functional classes, further duplications 
and divergence within the type II clade have taken place. 
Second, we have shown that the history of HDR gene duph- 
cation is very different from that of DXS. We identified two 
disparate modes of HDR duplicate acquisition in plants: 
ancient small-scale gene duplications in the cases of cucur- 
bits and gymnosperms and more recent polyploidy in the 
case of other flowering plant families. These differences may 
partially explain differences in the isoprenoid regulatory 
programmes of extant plants. The evolutionary history of 
a plant species may have included WGDs, gene loss, pseu- 
dogenization, single gene duplications, and functional spe- 
cialization, and here we have explored the impact of such 
events on the regulation of the MEP pathway in melon. 
Continued research in this area will provide new insights 
into the regulatory programme for the biosynthesis of this 
important class of natural products. 
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